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Abstract 



Understanding the emergence and evolution of multicellularity and cellular differen- 
tiation is a core problem in biology. We develop a quantitative model that shows that 
a multicellular form emerges from genetically identical unicellular ancestors when 
the compartmentalization of poorly compatible physiological processes into com- 
ponent cells of an aggregate produces a fitness advantage. This division of labour 
between the cells in the aggregate occurs spontaneously at the regulatory level due 
to mechanisms present in unicellular ancestors and does not require any genetic pre- 
disposition for a particular role in the aggregate or any orchestrated cooperative be- 
haviour of aggregate cells. Mathematically, aggregation implies an increase in the 
dimensionality of phenotype space that generates a fitness landscape with new fitness 
maxima, and in which the unicellular states of optimized metabolism become fitness 
saddle points. Evolution of multicellularity is modeled as evolution of a hereditary 
parameter, the propensity of cells to stick together, which determines the fraction 
of time a cell spends in the aggregate form. Stickiness can increase evolutionarily 
due to the fitness advantage generated by the division of labour between cells in an 
aggregate. 



2 



1 Introduction 



Life on Earth takes a myriad of different forms, and understanding the evolution of 
this complexity is one of the core problems in all of science. The origin of species and 
the diversity of ecosystems are paradigmatic representatives of evolving complexity, 
but similarly fundamental questions arise when studying the evolution of multicel- 
lularity and cell differentiation. The evolutionary transition from unicellular to mul- 
ticellular organisms is often referred to as one of the major transition in evolution 
( Maynard Smith & Szathmary (Il995 )). even though many of the requirements for 
multi celluarity probably evolved in u nicellular ancestors, thus facilitating the transi- 



tion (|Grosberg & Strathmannl (|2007|) ). The evolution of multicelluarity is character- 



ized by the integration of lower level unit s into higher level entities, and hence is a s 



sociated with a transition in individuality (Buss 



(11987b . 



Michod(1996 



1997 



2007)) 



Such transi t ions a r e thought to be based on cooperatio n between the lo wer level 



units (Buss 



(119871) . 



Mavnard Smith & Szathmarvl (U.995), 



Michodl (120071) ). and re 



cent models for the evolution of multicellu 



of labour (Gavrilets 



(l2010h . 



Rossetti et al 



arity ar e based on the concep t of division 



(2010), 



Willensdorferl (120091) ^. typically 



between some and germ cells (|Michodl (|2007D ). However, the existing models and 
explanations for the emergence of multicellularity provide only partial answers and 
raise further questions. In most models, some basic and pre-existing differentiation 
is assumed, and the circumstances under which such differentiation can be enhanced 
and stably maintained are investigated. To study the evolution of cell differentia- 
tion it is als o often assumed that undifferentiated cells already occur in multicellular 



aggregates (IGavri 



pressures (IBonnerj ( 



ets 



(120101) ). facilitated by s election on s i ze due to env i ronm ental 



19981)). such as predation fe ell ( 



the need for cooperation (IPfeiffer & Bonhoeffeil (I2000|) ). 



1985), 



Boraas et al. 



(119981) ) or 



In this paper we consider the simultaneous evolution of multicellularity and cell 
differentiation in a population of identical and undifferentiated unicells, based on the 
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idea that the emergence of multicellularity and subsequent cellular specialization are 
driven by the fitness advantages of a division of labour between cells. Such a division 
of labour need not necessarily occur in the form of soma and germ cells. Even simple, 
unicellular organisms need to perform physiological tasks that cannot be efficiently 
accomplished simultaneously by the same cell. Examples include biochemical in- 
compatibility between metabolic processes, such as between oxygenic photosyn 



thesis a n d oxygen-s e nsitive nitrogen fixation i n 



(2003a), 



Favl (11992b . 



Flores & Herrerol (|2009), 



cyanobacteria 



Rossetti et al. 



Ber man-Frank et al. 



(120101) '): motility and 



mitosis, processes which com pete for the use of the same cellular machinery, the mi- 



crotubule organizing center King (2004) )), and in general, reproduct i on and survival 



in ch allenging environment (IKirki (|2003l) . 



Nedelcu & Michodk 2006a). 



Rainey & Kerr 



(|2010|) ). Many unicellular organisms have overcome this problem by temporal segre- 
gation of incompatible activities, essentially cycling between phases dedicated solely 
to a single activity. These cycles c an be regulated by endogenous rhythmic mecha- 



Favl (1 19921V ). Other 



nism as well as by external signals (|Berman-Frank et al.\ (12003 a|) . 
cells found alternative means of limiting the detrimental effects of such incompat- 
ibility, such as introducing intracellular segregation, or limiting one activity to the 
minimum necessary for survival, or producing additional substances that chemically 
prevent the harmful interactions. 

In a multicellular organism, such incompatible processes can take place simul- 
taneously, but compartmentalized to separate cells. A first well-studied example of 
emerging intercellular separati on of poorly compatible act i vities i s the germ- soma 



specialization in volvox (IKirki (|2003|) . 



Nedelcu & Michod 



(2006a)). Somatic cells 



gather nutri e nts from the e nvironment, and provide germ cells with these nutrients 



(Bell 



dl985b, 



Koufopanoul(|l994|) . 



Solari et al. 



(2006)). The somatic cells are flagel- 



ated, and th e 



( Short et al. 



flage ll a are important for motility and transport of nutrients to the cells 



(2006), 



Solari et al. 



(|2006|) ). Flagellation and cell division are incom- 
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patible, and this fact is p robably one of the f actors promoting differentiation between 
somatic and germ cells ( Koufopanou (199^)). 

A second well-studied example is the incompatibility between photosynthesis 
and n itrogen fixation, and the reso l ution of this incompatibility in filamentous cyanobac 



tena 



Berman-Frank et al. 



(2002a), 



Favl(ll992h . 



Flores & Herrero 



(2009), 



Rossetti et al. 



(1201 On ). The key enzyme for nitrogen fixation, nitr ogenase, is sensitive to oxy- 
gen, and is thus inhibited by oxygenic photosynthesis (IFavl (|l992|V ). In filamentous 
cyanobacteria, this conflict is resolved by a spatial segregation of the two processes. 
A small proportion of the cells d ifferentiate into heterocysts that fix nitrogen and 
do not engage in photosynthesis (|Yoon & Golden! (|200l|) ). and these heterocysts ex- 
change substrates with the vegetative cells in the same filament. 

The fitness advantage of such a division of labour is an important, if not cru- 
cial, factor in the emergence and evolution of multicellularity and cell specialization. 
In fact, the unicellular ancestors often already posses the prototypes of regulatory 
mechanisms that are needed to maintain cell specialization in the multicellular forms. 
Consider an example of two incompatible processes A and B that are alternating in 
time in a single-cell organism and assume that the cell has developed a regulatory 
mechanisms that allows it to suppress the process A when the contrasting process 
B occurs. When two or more such cells come into a sufficiently close and long 
enough contact that allows them to exchange the benefits produced by these two pro- 
cesses, it may become more beneficial to end cycling in each cell and come to a 
steady state with one cell specialized in A and the other in B. At the basic cellular 
signaling level, the endogenous mechanism that drives unicellular cycling is often 
based on accumulation of the products of A or B during the active ph a se and subse 
quent depletion dur i ng the p assive phase (|Berman-Frank et al. 



(2003a), 



Kirk (2003), 



Nedelcu & Michodl (I2006a|) ). Hence, when a partner cell keeps producing the prod- 



uct A, the cell which produces B does not experience the shortage of the product 
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of A, which may bring the phase-changing mechanism to a halt. This principle can 
equally be applied to germ- soma specialization where A and B can be interpreted 
as reproduction and motility: reproductive cells may not run out of nutrients if they 
are repositioned by the soma cells of the colony to new feeding positions, while the 
soma cells do not die out as the reproductive cells keep on producing their genetically 
identical copies. 

The assumption that incompatible cellular processes suppress each other is sup- 
ported by empirical evidence. Experimental work with Volvox carteri has shown that 
cells that are destined to become reproductive suppress the expression of genes en- 



coding somatic functions, and somatic cells suppress germ cell functions (IMeissner et al. 



(Il999h . 



Tarn & Kirkl (| 1 99 . Also, gene regulatory mechanisms in unicellular ances- 



ticellularity, and contribute to 



tors can readily be co-opted during the transition to mu 
differential gene expression in somatic and germ cells (INedelcu & Michodl (1200631) ). 

Even if a unicellular form evolved to combine incompatible processes via other 
mechanisms, e.g. through the production of costly means of mediation of harm- 
ful interactions between metabolites, cells can often benefit from the opportunities 
that emerge in the multicellular form as well, e.g. when compartmentalization of 
contrasting processes in separate cells allows them to stop producing the costly me- 
diation metabolites. It is important to note that when single cells merge to form a 
two- or multi-cellular aggregate with permanently specialized cellular functions, no 
initial genetic specialization is required. Merging cells can be completely genetically 
identical, yet the symmetry of the initially unspecialized aggregated cells is broken 
spontaneously by regulatory mechanisms that function independently in each cell. 

Mathematically, the evolutionary advantage of the division of labour in aggre- 
gate forms can be viewed as the emergence of new, higher fitness maxima when 
the dimensionality of phenotype space is increased. The new fitness maxima are 
not a direct consequence of aggregation, but are based on the interaction between 
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aggregated individuals that engage in the division of labor. An increase in the dimen- 
sionality of phenotype space occurs when two or more cells couple their metabolism 
by exchanging metabolites. Because of this exchange, the fitness of an individual 
cell depends not only on the metabolic state of itself, but also on the metabolic states 
of its aggregation partners, and a maximum in low-dimensional space describing the 
physiology of a single cell may often become a saddle point (i.e., a point that is 
a fitness maximum in one direction in phenotype space, and a fitness minimum in 
another direction) when new dimensions, i.e., new cells are added. However, the 
physiological state of each cell, such as the activity level of different processes, is 
regulated only by the cell itself based on the external and internal cues. Thus, the 
new fitness maxima have to be achievable by independent regulatory adjustments of 
each cell. 

In this paper we develop a model that serves as a quantitative description of the 
qualitative discussion above. For the case of two incompatible processes, we will 
show that a sufficient condition for reaching a higher fitness state of cellular spe- 
cialization in aggregates is the existence of unicellular regulatory mechanisms that 
suppress one process when the other is active, and vice versa. More specifically, we 
derive conditions in terms of the fitness function that favour the existence of a saddle 
point, and hence the evolution of division of labour, in 2-cell aggregates. We also 
show that the fact that higher fitness maxima can be attained in aggregate forms in 
turn selects for adhesion mechanisms that allow cells to form aggregates in the first 
place. We illustrate the general arguments by choosing particular forms of fitness 
costs and benefits and consider a simplified scenario of exchange of metabolites, and 
evolution of adhesion. Thus, the results show that the emergence of multicellular- 
ity and cell differentiation can, in theory, result from the evolution of the propensity 
of cells to aggregate driven by the fitness advantage of division of labour in the ag- 
gregate forms. This advantage is due to incompatible physiological processes being 
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performed separately in dedicated specialized cells, such that the results of these 
processes are shared among the cells in the aggregate. The division of labour occurs 
spontaneously at the regulatory level due to mechanisms already present in unicellu- 
lar ancestors and does not require any genetic pre-disposition for a particular role in 
the aggregate. 

2 Model definition 

We envisage the simplest possible scenario of aggregation, the formation of a union 
of two cells, and we consider a population of cells that reproduce, die, and between 
birth and death can exist in uni-cellular or two-cellular forms. The population den- 
sities of single cells and two-cellular aggregates are denoted by n\ and n 2 . We also 
assume that the transition between uni-cellular to two-cellular forms are reversible 
and may occur a number of times during a cell's lifespan. The binding constant, 
which determines the fraction of time a cell spends in the two-cell form, is controlled 
by a heritable (genetic) parameter < a < 1, which we call the cell stickiness. The 
total rate of aggregation A i:j between cells of stickiness types and aj is then given 
by 

Aij = k+n^aJcTin^cr^aj, (1) 

where k + is an aggregation constant, which we assume to be identical for all cells, 
and ni(ai) is the population density of single cells with stickiness 07. Also, for 
simplicity we assume that rate of dissociation of an aggregate is independent on 
the stickiness of its (two) constituents. The per capita dissociation rate is denoted 
as Together, these assumptions make the fraction of time a cell spends in the 
aggregate state an increasing function of its stickiness. If a cell in an aggregate 
dies, the remaining cell becomes a single cell. If a cell in an aggregate divides, the 

8 



daughter cell is released as a free cell while the aggregate remains intact. 

To measure "fitness", we assume that cells can produce two metabolites x and 
y, and that production of these metabolites confers a benefit B(x,y) and has a cost 
C(x, y). The rate of reproduction of a cell that has metabolic rates (x, y), i.e., its 
fitness, is then determined by the functions B and C. For the single cell state, a 
simple form of the benefit function reflects the requirement that both metabolites are 
essential for the normal functioning of the cell: 



B(x,y) = xy (2) 

Any realistic cost function C should satisfy the following constrains: First, since 
metabolic rates cannot increase indefinitely, the fitness must have a maximum or 
maxima at some intermediate metabolic rates and rapidly decrease for high metabolic 
rates. Hence the cost function C should grow faster than B in any direction in the 
x — y plane. Second, to incorporate our assumption that the production of x and 
y are poorly compatible cellular processes, the cost function C should exhibit an 
"inefficiency penalty" for producing both metabolites x and y in the same cell. To 
incorporate these assumptions we consider cost functions of the form 



C(x,y) = c x x exp(y ) + c y y exp(x ). (3) 

Essentially, this form means that if only one metabolite is produced (for example, 
y = 0), the cost grows fairly slowly (algebraically), ~ x 3 , but when both metabolites 
are produced, x = y, the cost grows much faster, ~ x 3 exp(x 2 ). There are many 
forms of cost functions (e.g. algebraic ones, C(x, y) = c x x 3 (l + y 4 ) + c y y 3 (l + x 4 )) 
that satisfy these two constraints and would lead to very similar results as presented 
below. 

We note that this equat ion captures aspects of an empirically measured cost func- 



tion (Dekel & Alon 



(2005)), namely that the metabolic costs associated with express- 



ing a trait increase faster than linear with the expression level. The "inefficiency 
penalty" does not have a direct empirical basis, but represents the general principle 
of poorly compatible cellular processes. The importance of this will be explained in 
more detail below. 

Our choice of B and C leaves only 2 free parameters, c x and c y , and to facilitate 
the visualization of the fitness landscapes (see below), we assume symmetry, i.e., 
c x = c y = c. This assumption is not crucial for the results reported. 

Metabolites produced by each cell are consumed by the cell itself when it is in 
the unicellular form, or are assumed to be equally shared for consumption within a 
two-cell form. Hence the two-cell benefit function takes the form 



At the same time, each cell bears the costs of everything it produces in either single- 
or multi-cell state. Overall, fitness, i.e., the cellular rate of reproduction, is given by 
the difference between the benefits and the costs of metabolism. For a a single cell 
with metabolic rates (x, y), fitness is therefore 



On the other hand, the individual fitness of a cell i (i — 1, 2) in a two cell aggregate 
with metabolic rates {xi, x 2 , y 1: y 2 } is 



iM*l, *2, 1/1, V2) = { ~ 7^ " ^ + ^ 6XP( ^ )] - 



Note that Ri(x, y) = i?2,i(x, y, x, y), i.e. the fitness of a unicellular form is equal 
to the fitness of each cell in the two-cellular form when both cells are producing 
the same amount of metabolites x and y (when analyzing the model dynamics be- 
low, we assume a cost of stickiness. Cells in two-cell aggregates tend to have higher 
stickiness, and thus pay an additional cost). 




Ri(x, y) = xy - [cx 3 exp(y 2 ) + cy 3 exp(x 2 )]. 



(4) 



10 



Due to the symmetry of R\, and because of the cost of producing high levels of 
metabolites, the fitness of a single cell will be maximized at some intermediate level 
of metabolite production (x, y) = (x*,x*). Under certain conditions, this single-cell 
maximum becomes a saddle point for the fitness function of cells in a 2-cell aggre- 
gate. These conditions can be most easily seen by assuming complete symmetry 
between the two cells of a 2-cell aggregate, i.e., by setting x 1 = y 2 and x 2 = yi in 
©. The fitness function of a cell in a 2-cell aggregate, which a priori is a function of 
four variables, then becomes 



F(x,y) = R 2 ,i(x,y,y,x) (6) 

(i = 1, 2), i.e., a function of two variables. If the function F is restricted to the 
diagonal x = y, the single cell maximum (x*,x*) can be recovered as a maximum 
along this diagonal, and the question is under what conditions this point becomes a 
saddle point, i.e., a minimum along the anti-diagonal. 

In the Appendix it is shown that the main criterion for (x*,x*) to be a saddle 
point is 



d 2 F 

{x*,x*)<0. (7) 



dxdy 

This essentially means that the production of the two metabolites should be anti- 
synergistic in the vicinity of (x*, x*), so that the gain from producing more of both 
metabolites is less than linear. For example, this can occur with the inefficiency 
penalty assumed for the cost function ©. If this anti-synergy is strong enough, the 
single cell optimum (x*, x*) becomes a saddle point for the fitness function of cells in 
2-cell aggregates. Indeed, by construction of (14151) . there exists a range of parameter 
c for which the maximum (in {x±, x 2 , y%, y 2 } space) of the reproduction rate for a 
cell that is a part of a two-cellular complex is higher than that of a single cell. This 
is illustrated in Figure 1 for c = 1/25. 

11 



1 2340 1 234 

X X 



Figure 1: The metabolic fitness landscape for single cells (defined by Eq. ©, left 
panel) and for the corresponding two-cell aggregates (defined by Eq. ©, right 
panel). For the two-cellular fitness we have assumed an anti-symmetry between the 
metabolic states of the cells, x\ = y 2 and x 2 — y%. The form of the fitness landscape 
© implies that the maxima visible in the right panel remain the same in the unre- 
stricted 4-dimensional space {x±, y±, x 2 ,y 2 }. The figure shows how a maximum on 
the diagonal x = y of the left panel, corresponding to equal rates of production of 
x and y, becomes a saddle point for the two-cell fitness landscape in the right panel. 
Two maxima near the horizontal and vertical axes in the right panel correspond to 
compartmentalization of production of x and y in the two-cell state: while one cell 
produces only x, the other cell produces only y. The white area in the plots corre- 
sponds to negative birth rates (i.e., B — C < 0). The landscapes shown correspond 
to c x = c y = c = 1/25. 
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The left panel depicts the single-cell state fitness R\ (x, y), which has a maximum 



of a cell in a two-cell aggregate. With the assumption that we made above on the 
x — y symmetry of the cost function, the maxima of the two-cell-state fitness are in 
the subspace when two cells are exactly in anti- symmetric metabolic states, x\ = 
y 2 = x, and x 2 = Vi = V, hence they are maxima of the fitness function © (here 
pmax _ g25/432 w 1.45). One can compare the two panels by noting that the 
diagonal x = y in both of them corresponds to the same function F(x, x). But while 
for the single-cell state the two-variable fitness R\(x, y) has a global maximum along 
the diagonal x = y = x* (left panel), the two-cell state fitness F(x, y) has a saddle 
point at x = y = x* on the diagonal (right panel). For larger c this condition is no 
longer satisfied, so the two-cell state has the same diagonal fitness maximum as the 
single-cell form, and division of labour would not be expected. 

It should be noted that condition © is a local condition near the single cell max- 
imum (x*, x*), and that the cost functions © is of course only one of many such 
functions that lead to fitness functions that can satisfy this condition ©. Thus, the 
cost (and benefit) function used here merely serve to illustrate a general principle. 

To complete the basic model description, we assume that cells, whether in uni- or 
two-cellular form, reproduce individually by periodically releasing unicellular off- 
springs at a rate that is proportional to the fitness of a cell. The only cell property 
that is inherited during reproduction is the stickiness a. A small random variation in 
a occurring during reproduction corresponds to mutations in this genetically deter- 
mined trait. Finally, we assume a logistic form of the per cell death rate Dp, which 
is independent of whether a cell is single or a part of an aggregate, 



at the diagonal x = y = x* with R 



<max 



1 



0.866. The right panel shows the fitness 




(8) 



where 5 is a parameter and N is the total population. 
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3 Model dynamics 

There are three biologically distinct time scales in our model: the fast regulatory 
metabolic adjustment, the intermediate rate of cellular aggregation and dissociation, 
and slow reproduction and concomitant evolution of the heritable trait a. This natu- 
ral time scale separation allows us to simplify the mathematical analysis, assuming 
steady states of the faster processes in the dynamics of the slower events. 

3.1 Metabolic regulation 

So far we have specified benefits and costs of producing two metabolites at rates x 
and y, but we have not specified what controls the dynamics of metabolic regulation 
of x and y. The basic assumption is that each cell adjusts the rates of production of the 
metabolites to maximize fitness for given conditions (unicellular or aggregate form) 
via a fast regulatory mechanism. The important part of this assumption is that each 
cell acts individually to maximize its fitness, without "coordinating" the metabolic 
regulation with its partner, yet the conditions in which a cell operates do depend on 
whether or not the cell is in aggregate form, and if it is, on the metabolic state of 
the partner. We first assume that in the single cell state, each cell has a naturally 
occurring mechanism for regulating its metabolite production to the optimum of the 
fitness landscape given by B(x,y) — C(x,y), with B and C the cost functions |2] 
and [3] The fitness landscape in metabolic space defined by Eq. © and illustrated in 
Figure 1 has a fairly simple form, so it seems reasonable to assume the regulatory 
convergence to the unicellular fitness maximum. 

For example, one can assume that the cells adjust the metabolic states following 
the gradient in the fitness landscape with some random noise r], 
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(r] zi (t)} = 0, (Vzi(t)v Z j(t')) = 5 itj 5(t - t')T 



(9) 



Here Z{ is one of the two metabolic coordinates {x, y}, i] zi (t) is a random noise term 
with zero mean that is assumed uncorrelated in time and between different coordi- 
nates. A process given by © conver ges to a steady sta te distribution of the cell 



population in "metabolic space" n(z) (Ivan Kampen 



09921)), 



n(z) = no exp 



2a „ . . 



(10) 



where z = (x, y) are again the metabolic coordinates in the unicellular state. We 
assume that the dynamics is fast (large a) and the noise is weak (small T), so that 
the population quickly becomes concentrated in the vicinity of the metabolic state 
conferring maximum fitness. 

We assume that the same regulatory mechanisms that lead to metabolic fitness 
maximization in the unicellular state regulate the metabolic rates of two cells in ag- 
gregate form to a metabolic fitness maximum for the two-cell aggregate (essentially, 
the noise term in the metabolic dynamics © leads to symmetry breaking, enabling 
two cells that just aggregated to diverge in their metabolic phenotypes). In fact, the 
dynamics given by ©, applied to z% being one of the four coordinates {x±, yi,x 2 , 2/2}, 
also lead to a concentration of the metabolic rates [x\ 1 y±,x 2 , y 2 ) of two aggregated 
cells in the vicinity of a fitness maximum. The corresponding fitness landscape is 
illustrated in Figure 1 (right panel). The Figure illustrates that the fitness maximum 
of a cell in a two-cell aggregate can be higher than the metabolic fitness maximum 
attained by a single cell. Essentially, this is because the fitness maximum of a sin- 
gle cell, which lies on the diagonal x = y, becomes a saddle point in the higher- 
dimensional metabolic fitness landscape of the aggregate form. 
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In the following, we therefore assume that each cell is at a metabolic state that 
maximizes its reproductive rate. This allows us to drop the metabolic coordinates 
from the notation for the cell concentration. For example, for c x = c y = 1/25, the 
(steady state) reproduction rate of a single cell becomes R\ ~ 0.866 (maximum on 
the diagonal of the left panel of Fig. 1) and the reproduction rate of a cell in a two- 
cell state R 2 = 625/432 1.45 (corresponding to the maxima in the right panel of 
Fig. 1) 

3.2 Transition from unicellular to multicellular states 

The kinetics of the association of two cells with stickiness <j\ and a 2 into a two-cell 
complex with concentration n c (cr 1 , cr 2 ) is described by the following rate equation, 

— c ^' 2 = k + n 1 (ai)a 1 n 1 (a 2 )a2 - k_n c (a 1 ,a 2 ) - 28Nn c {ai,a 2 ). (11) 

Here the gain or association term has the form introduced in ©, the first loss term 
describes the dissociation of a two-cell complex into individual cells and the second 
loss term describes the loss of a two-cell complex due to a death of one of its con- 
stituents as defined in ®. The factor 2 reflects the fact that a death of any of two 
cells is sufficient for the elimination of a two-cell aggregate. Assuming sufficiently 
fast association and dissociation compared to the time scale of the evolution of stick- 
iness, we calculate the steady state concentrations of two-cell complexes for a given 
density of single cells, 

*/ \ k + n x {a 1 )a 1 nx{a 2 )a 2 
n ^^ = k~+26N • (12) 

To determine the steady state concentration n 2 (a) of individual cells in aggre- 
gates, the concentration of complexes n*{a 1: a 2 ) has to be integrated over one of its 
coordinates, 
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n* 2 {a)=2 n* c {a,a')da' = m{a)x (13) 
Jo 

2k + a J Q °° ni(a')a'da' 
X = k_ + 2SN ' 

The factor 2 reflects the fact that either of two cells in an aggregate can have the 
stickiness a. Note that \ needs to be determined self-consistently since the quantity 
N in the denominator depends on the total number of cells, N — N x + N 2 , Nj = 

Jnj{a)da. 

A rate equation that is analogous to (fTTT) holds for the evolution of the single-cell 
population: 



f) ( \ poo 

ni } a) = - 2k + n l (a)a / nAa'^a'da' + k„n 2 (a) + 5Nn 2 (a) - SNnAa) 
dt Jo 

+ [R 2 n 2 (a) + Rmxia)} M(a) (14) 

Here the first loss term denotes the association of a single cell a into an aggregate 
with any other single cell (with the factor 2 describing that two single cells are lost), 
the first gain term describes the dissociation of an aggregate, the second gain term 
describes the appearance of a single cell when a cell in an aggregate dies, and the 
second loss term corresponds to the loss of a single cell due to its death. The gain 
term in the third line accounts for the appearance of a new single cell due to cell di- 
vision, which occurs within aggregated and free cells with rates R 2 and R x , defined 
by eqs. © and ©. The term M(o), a decreasing function of a, accounts for the 
fitness costs of maintaining stickiness. This term imposes a penalty for two-cell ag- 
gregates; aggregation can thus only evolve if it provides a non-negligible advantage 
to the aggregating cells. 
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Figure 2: Evolution towards cell aggregation and two-cellularity. Panel a) shows 
the population distribution of the trait a over time, with brighter areas indicating 
higher densities. Panel b) shows the proportion of 2-cell aggregates, N 2 / (Nx + N 2 ) 
with Nj = J rij{a)da for j = 1, 2 as a function of time. For the figure, eq. (fl~5T) 
was solved numerically for the following parameters: k + = 10, fc_ = 1,6 = 1, 
M(x) = 1 — x/10, and D = 1CT 3 . As mentioned earlier, the effective birth rates 
for unicellular and two-cellular forms were assumed to be perfectly optimized to the 
corresponding cellular state, Ri = 0.866 and R 2 = 1.45. 

3.3 Evolution of stickiness 

Substituting (fl~3l) into (fl4l) we arrive at the equation for the evolution of the popula- 
tion density of cells with stickiness a, 



The diffusion term, proportional to a small constant D and added to the birth term, 
describes mutational variation in stickiness at birth. Eq. (fT51) can be solved numer- 
ically. For suitable parameter combinations, the population evolves towards higher 
stickiness, resulting in cells spending most of their life in two-cell aggregates. This 
process is illustrated in Figure 2 and is due to the higher birth rates and resulting 



drii(a) 



5N ni (a)(l + x) 



(15) 



+ M(a)(R 2X + Ri) n x {a)+D 



3a 2 
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evolutionary advantage of cells that are more sticky and thus spend more time in a 
state that is metabolically superior due to division of labour. 



4 Discussion 

The evolution of cell differentiation in multicellular aggregates is an important tran- 
sition in the history of life on Earth. Most existing models of this transition as- 
sume some pre-existing differentiation in the single cells and/or t he pre-existenc e 



of some form of compartmentalization, i.e. , multicellularity (e.g 



Gavrilets 



Rossetti et al. 



(l2010h . 



Willensdorferl (|2009)). The main result of the present study 



is a proof of principle that multicellularity and cell specialization can emerge in ge- 
netically and physiologically homogenous populations via spontaneous breaking of 
cellular universality (or symmetry) by regulatory non-hereditary metabolic mecha- 
nisms. 

Such symmetry breaking occurs if the division of labour between cells brings cer- 
tain fitness advantages and if regulatory mechanisms that allow cells to optimize their 
physiology exist in the ancestral unicellular form. It is important to note that with 
such mechanisms, the cells adjust their regulatory state individually, in a "selfish" 
way, thus no assumption about special cooperative interactions between the cells in 
an aggregate are necessary. Essentially, a suitable fitness landscape, exhibiting higher 
fitness for cells with differentiated functions in the aggregate form, determines the 
path of regulatory optimization towards cellular specialization. The prerequisite for 
this is that the 4-dimensional fitness landscape of a 2-cell aggregate has higher peaks 
than the 2-dimensional fitness landscape of a single cell, reflecting the advantages 
of division of labour. In other words, the maximum of the 2-dimensional landscape 
turns into a saddle point in the 4-dimensional landscape. This qualitatively defines 
the general properties of the fitness function that promote (or inhibit) the transition to 
multicellularity, and can be formalized by considering second derivatives of fitness 
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functions (see Appendix), which reveals that the main criterion for the maximum 
of the 2-dimensional landscape to turn into a saddle point in the 4-dimensional land- 
scape is that there should be anti- synergistic interaction between the two metabolites, 
so that an increase in the production of both metabolites in a single cell has a suffi- 
ciently high inefficiency penalty. The cost functions used in this paper provide one 
example of such an inefficiency penalty. 

As a consequence of metabolic inefficiency, an effective increase in the dimen- 
sionality of the physiological pathways due to aggregation enables the cells to attain 
higher fitness in the aggregate form by dividing the labour of producing the two 
metabolites. If stickiness, i.e., the tendency to form aggregates, is a trait under selec- 
tion, and if the costs of stickiness is not too large, then the increase in dimensionality 
of the fitness landscape and the concomitant increase in physiological fitness leads to 
the evolution of more sticky cells, resulting in the emergence of multicellular! ty and 
cellular specialization within the aggregates. We note that this mechanism for the 
evolution of aggregation of single cells into multicellular clusters is different from 



the c 



size (Bell 



assic hypo t hesis tha t such aggregation is driven by some form of selec tion for 



41985b, 



Bonnerl (119981)'). e.g. due to preda tion (IBoraas et al. 



need for cooperation (|Pfeiffer & Bonhoefferi (|2000f) ). 



(1 1998b ) or the 



In virtually all models for cell differentiation, the basic underlying mechanism is 
a tradeoff between different physiological functions. It is then usually assumed that, 



for unspecified reasons, there exist different cell types that either a 



Rossetti et al. 



Gavrilets 



(2010), 



ready occupy 



Willensdorfer 



Lfl) 



(12010 )). Subsequently, 



different locations on the tradeoff curve (e.g. 
(|2009|) ). or have the genetic potential to do so (e.g 
the optimal composition of the different cell types in cell aggregates is studied. In 
contrast, in our model, all cells are in principle physiologically identical, and the 
differentiation only manifests itself through a purely physiological regulatory mech- 
anism once cells occur in aggregates. In essence, the symmetry breaking regu- 
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latory mechanism generates a permanent spatial differentiation in cell aggregates. 
Such a physiological crystallization of potential temporal differentiation of single 
cells has been envisioned as one of the main routes to multicellular differentiation 



(INedelcu & Michodl (I2006b|) ). and our model can serve as a basic metaphor for this 
process. 

In most accounts, the basic physiological tradeoff underlying the transition to 
differentiated multicellularit y is between rep r oduction and viability, and hence b e- 



tween soma and germ cells (|Gavriletsl (|2010f) . 



Kirk (2003), 



Michod (2006 



2007)) 



The model presented here can also be viewed in that context. Then the physiological 
variables x and y become traits describing reproductive productivity (number of off- 
spring) and viability (probability to survive to reproduction). The latter depends on 
many factors, such as the abil ity of cells t o move, which in some types of organisms 
is incompatible with mitosis (|Kingl (|2004l) ). In a germ-soma specialization scenario, 
the regulatory mechanisms relevant for symmetry breaking may be based on a re- 
spons e to s i gnals to stop growth that are emitted by fully developed bigger germ 
cells (IKirkl (|2003l) ). Such signals could arrest the development of pro-soma cells, 
rendering them sterile. The predisposition for initial size and subsequent germ-soma 
differentiation would then stem from spontaneous asymmetric cell division (rather 
than from genetic differences). 

To critically evaluate the plausibility of our model for the evolution of multi- 
cellularity, it will be essential to test the main assumptions and predictions exper- 
imentally. The most critical assumptions of the model are that i) some important 
cellular processes cannot be performed well in the same cell, ii) cells can readily 
evolve increased levels of attachment, and iii) attached cells can complement each 
other metabolically, and thus specialize on one of two poorly compatible processes. 
The first assumption, about trade-offs betw een cellular process es, if fundamental to 
most models of metabolic specialization (|Gudelj et all (|2010|) ). A number of re- 
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cent studies have estab 



offs (IBeg et al. 



Scott et al. 



(2007), 



ished concrete molecular mec 



lanisms that can lead to trade- 



Carlsonl (120071) . 



Knight et al. 



(2006), 



Molenaar et al. 



(2009), 



(|2010l) ). and it will be interesting to test whether such trade-offs are perva- 
sive between different types of metabolic processes, and in many different organisms. 
If they are, this would increase the plausibility of the evolutionary transition towards 
multicellulary proposed here. 

Testing the predictions of our model is challenging, but seems possible in prin- 
ciple. The main prediction is that conditions in which important cellular processes 
are incompatible with each other will promote the evolution of increased levels of 
attachment between complementary cell types. It is worthwhile considering whether 
this prediction can be tested with evolutionary experiments in the laboratory. As dis- 
cussed above, oxygenic photosynthesis and nitrogen fixation are prime examples of 



incompatible processes (Berman-Frank et al. 



(|2003bl) ). and unicellular cyanobacteria 



separate these processes temporarily, by performing photosynthesis during daytime 
and nitrogen fixation at night. One possible direct test of our model would be to 
evolve unicellular cyanobacteria in the laboratory under conditions where both pro- 
cesses are expected to be active i.e., in continuous light in medium without fixed 
nitrogen and ask whether the bacteria evolve adhesion and exchange of fixed com- 
pounds between cells that perform different processes. It is worth noting that the 
evolution of stickiness, i.e., of unicell ular organisms form ing multicellular clusters, 



has recently been observed in yeast (IRatcliff et al. 



(|201l|) ). although the importance 



of incompatible metabolic process for this phenomenon remains to be determined. 

In conclusion, in this paper we present a model showing that multicellularity and 
cellular differentiation can develop when cells can form an aggregate that enables 
them to exchange chemical signals and metabolites. This aggregate essentially has 
a higher physiological dimension, so that when there are cellular processes that are 
incompatible in a single cell, segregation of these processes into separate cells is pos- 
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sible in the aggregate form. Regulatory mechanisms that can control such a division 
of labour within an aggregate can be expected in many ancestral unicellular forms 
and are based on signals coming either from the cell itself, or from partner cells in 
the aggregate environment. The resulting division of labour can generate fitness ben- 
efits that lead to selection on the propensity of cells to aggregate, and hence to form 
multicellular and differentiated organisms. 
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5 Appendix 

As explained in the text, the symmetry between x and y, which follows from the form 
of the fitness functions ©, allows for a reduction of the dimension of metabolic 
space from four to two. Accordingly, we consider the symmetric fitness function 
F(x, y) = R.2,i(x, y, y, x), i = 1,2, where the i? 2 ,i are the fitness functions © of a 
single cell in a 2-cell aggregate. Then the restriction of F to the diagonal x = y is 

g(z) = F(z,z), (16) 
and the restriction of F to the anti-diagonal through the point (x*, x*) is 

h(z) = F(z,2x*-z). (17) 
Along the diagonal, (x*, x*) is a fitness maximum by assumption, hence 



gf(zV) = + 2^( X 'X) + < 0. (18) 



For (x*, x*) to be a saddle point of F, it must be a fitness minimum along the 
anti-diagonal, hence we must have 



f) 2 h rl 2 F r) 2 F r) 2 F 

^-(**,x*) = W**,**) - 2^— (x*,x*) + ^y(x*,x*) > 0. (19) 
oz A ox z oxoy oy z 

It is clear that this last inequality inequality is satisfied if ^£^{x* %*) is negative 
enough. We note that (fT9~l) also tends to be satisfied if the pure second derivatives of 
F are positive at (x*, x*), but this also tends to violate the condition (fT8b for (x*, x*) 
to be a maximum along the diagonal. If symmetry between x and y is not assumed, 
similar considerations lead to analogous criteria in terms of second derivatives of 
fitness functions for a maximum in 2-dimensional to become a saddle point in 4- 
dimensional space. 
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